AITopics | anticorrelated noise injection

Collaborating Authors

anticorrelated noise injection

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Papers Simplified: »Anticorrelated Noise Injection for Improved Generalization«

#artificialintelligenceAug-18-2022, 14:15:26 GMT

In this article, I will not explain to you all of the (exciting!) Instead, I will provide you with some implementations and pictures that should make it possible to understand the gist of the paper. I also gave my best to create an implementation of the optimizers mentioned in the paper, but use the code with care because I'm also not an expert in this regard. In order to understand what Anti-PGD (Anti-Perturbed Gradient Descent) is about, let us shortly recap how GD and the derived algorithms such as SGD and PGD work. Let us assume that we want to minimize a function f with a gradient denoted as f(θ).

anti-correlated perturbation, anticorrelated noise injection, paper simplified, (1 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.38)

Add feedback

Anticorrelated Noise Injection for Improved Generalization

Orvieto, Antonio, Kersting, Hans, Proske, Frank, Bach, Francis, Lucchi, Aurelien

arXiv.org Machine LearningFeb-6-2022

Injecting artificial noise into gradient descent (GD) is commonly employed to improve the performance of machine learning models. Usually, uncorrelated noise is used in such perturbed gradient descent (PGD) methods. It is, however, not known if this is optimal or whether other types of noise could provide better generalization performance. In this paper, we zoom in on the problem of correlating the perturbations of consecutive PGD steps. We consider a variety of objective functions for which we find that GD with anticorrelated perturbations ("Anti-PGD") generalizes significantly better than GD and standard (uncorrelated) PGD. To support these experimental findings, we also derive a theoretical analysis that demonstrates that Anti-PGD moves to wider minima, while GD and PGD remain stuck in suboptimal regions or even diverge. This new connection between anticorrelated noise and generalization opens the field to novel ways to exploit noise for training machine learning models.

anti-pgd, anticorrelated noise injection, noise, (11 more...)

arXiv.org Machine Learning

2202.02831

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Switzerland > Basel-City > Basel (0.04)
Europe > Norway > Eastern Norway > Oslo (0.04)
(2 more...)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback